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ABSTRACT 

Due  to  the  new  military  operations  other  than  war,  the  extent  of  the  natural  language  parts  in  formalized 
military  messages  increases.  In  order  to  guarantee  further  automatic  interoperability  between  different 
C4ISR  systems  three  problems  must  be  solved.  First,  the  natural  language  text  must  be  automatically 
transformed  into  a  formal  structure,  which  represents  the  meaning  of  the  text.  Second,  the  formal  structure 
must  be  exchanged  between  the  different  C4ISR  systems.  And  third,  the  representation  of  the  meaning 
must  be  automatically  inserted  into  the  C4ISR  database.  In  this  paper,  we  show  that  Information 
Extraction  techniques  can  be  used  as  a  possible  solution  to  the  first  sub  problem.  As  the  formal  structure 
for  representing  the  meaning  we  use  typed  feature  structures. 


1.0  INTRODUCTION 

Relevant  military  information  is  available  in  different  forms.  On  one  side  of  the  spectrum,  there  are  highly 
formalized  military  messages  (e.g.  cf..  [2]).  On  the  other  side,  we  can  find  natural  language  texts  with  little 
structure  (e.g.  documents,  e-mails,  web  pages).  Due  to  the  new  military  deployments  (military  operations 
other  than  war)  the  extent  of  the  natural  language  parts  in  formalized  military  reports  increases.  This 
means,  that  more  and  more  relevant  information  is  only  available  in  natural  language  texts. 

Interoperability  between  different  C4ISR  systems  can  be  realized  by  replication  (cf.  [3])  or  by  sending 
formatted  messages.  In  both  cases,  the  production  and  processing  of  the  exchanged  information  should  be 
completely  automatically.  But,  if  the  extent  of  the  natural  language  parts  increases,  then  a  processing  of 
the  texts  by  humans  is  necessary  before  the  information  can  be  entered  into  the  databases  of  the  C4ISR 
systems.  This  is  an  interruption  of  the  automatic  interoperability  process. 

If  the  above-described  problem  of  interruption  should  be  solved,  three  sub  problems  must  be  handled. 
First,  the  natural  language  text  must  be  automatically  transformed  into  a  formal  structure ,  which 
represents  the  meaning  of  the  text.  Second,  the  formal  structure  must  be  exchanged  between  the  different 
C4ISR  systems.  And  third,  the  representation  of  the  meaning  must  be  automatically  processed  and  the 
result  inserted  into  the  C4ISR  database. 

In  this  paper,  we  propose  to  use  Information  Extraction  (IE)  techniques  to  solve  the  first  mentioned  sub 
problem.  IE  is  an  engineering  approach  based  on  results  of  computational  linguistics  to  build  systems  that 
process  huge  amount  of  texts  of  a  specific  sort.  Each  IE  system  is  tailored  to  a  specific  domain  and  task. 
The  result  of  the  IE  process  (the  meaning  of  the  text)  is  represented  in  feature-value  structures.  These 
feature-value  structures  together  with  an  ontology  are  the  fundamental  mechanism  for  representing  the 
natural  language  meaning.  The  second  sub  problem  can  be  solved  easily.  Instead  of  sending  the  natural 
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language  text,  the  feature  structure  will  be  send.  The  third  sub  problem  demands  for  a  mapping  between 
feature-value  structures  and  relational  databases.  In  our  SOKRATES  project  we  have  developed  a 
component,  which  realizes  this  mapping  into  the  LC2IEDM  database  (cf.  [4]). 

This  paper  is  structured  as  follows.  First,  we  will  introduce  an  extended  view  of  the  interoperability 
problem  due  to  free-form  texts.  Then,  information  extraction  for  producing  the  formal  representation  of 
meaning  (cf.  [1],  [14])  is  shortly  introduced.  In  the  main  part  of  the  paper,  we  show  how  the  natural 
language  part  of  a  military  message  is  represented.  The  presented  example  is  from  our  SOKRATES  project 
(cf.  [4],  [5],  [6],  [7],  [8],  [9],  [10],  [11],  [12],  [16],  [17])  in  which  IE  techniques  and  feature  structures  are 
used  to  represent  German  battlefield  reports. 


formatted  part  of 
military  report 


<TRA  123>  | 
<MOV  alb  > 


natural  language  part 
of  military  report 


typed  feature 
stmcture 


whole  formalized 
military  report 


Figure  1 :  Combination  of  a  formal  and  a  natural  language  part  in  a  military  message 


2.0  NATURAL  LANGUAGE  TEXT  AND  THE  INTEROPERABILITY 
PROBLEM 

The  interoperability  between  different  C4ISR  systems  assumes  that  the  interchanged  information  is 
formatted  according  to  a  specific  formal  standard.  In  general,  this  standard  is  not  designed  to  handle 
natural  language  text  as  part  of  military  messages.  Due  to  the  new  military  operations  other  than  war,  the 
extent  of  the  natural  language  parts  in  formatted  military  reports  increases.  Nevertheless,  the  automatic 
information  exchange  between  the  C4ISR  systems  should  be  still  completely  automatically.  To  realize 
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this,  the  natural  language  text  must  be  transformed  into  a  formal  representation  of  the  meaning  of  the  text. 
This  semantic  representation  than  can  become  a  part  of  the  formal  standard. 

In  Figure  1,  the  situation  is  shown  in  more  detail.  On  the  left  side,  the  formal  part  and  the  natural  language 
part  of  a  military  report  are  shown.  The  written  text  ("3  BRDM  observed  ...")  must  be  transformed  into  a 
formal  semantic  representation.  We  propose  to  use  Information  Extraction  (IE)  technology.  IE  utilizes  an 
ontology  (a  formal  description  of  the  domain  concepts)  to  produce  a  formal  description  of  the  meaning 
(typed  feature  structure).  Both  parts  of  the  report  are  now  available  in  a  formalized  form  and  are 
combined.  The  whole  formalized  report  is  than  used  in  the  interoperability  process. 


replication 


Figure  2:  Processing  of  the  formalized  report  in  the  C4ISR  systems 

In  Figure  2  the  processing  of  the  whole  formalized  report  is  shown.  This  can  be  realized  in  two  different 
ways.  Reports  can  be  interchanged  directly  by  sending  the  produced  formatted  message.  The  other 
possibility  is  the  replication  of  the  database  content.  If  the  messages  are  send,  in  each  C4ISR  system  the 
formal  representation  must  be  transformed  into  SQL  statements.  For  those  parts  of  the  formal 
representation  that  origin  from  the  natural  language  text,  the  processing  must  use  the  ontology  from  the  IE 
process.  This  means,  that  the  interoperability  of  the  natural  language  parts  of  the  messages  create  the  new 
constraint  that  a  common  ontology  must  be  used. 

In  the  rest  of  this  paper,  one  possibility  of  formalizing  the  meaning  of  military  reports  is  presented.  This 
approach  was  developed  in  the  SOKRATES  project.  In  this  project,  we  applied  IE  to  the  analysis  of 
German  free-form  battlefield  reports.  The  SOKRATES  prototype  is  able  to  process  simple  battlefield 
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reports  about  movements  (cf.  [9],  [10],  [11],  [12]).  The  objective  of  the  SOKRATES  project  was  not  to 
show  the  applicability  of  the  IE  to  the  above  described  interoperability  problem.  But  we  believe  that 
results  from  our  project  are  a  good  initial  point  to  find  solutions  for  the  mentioned  interoperability 
problem. 


3.0  INFORMATION  EXTRACTION 

The  information  extraction  (IE)  identifies,  collects  and  normalizes  relevant  information  about  the  Who, 
What,  When,  etc.  from  natural  language  texts  (cf.  [1],  [14]).  The  relevant  information  is  described  through 
patterns  called  templates.  These  domain  and  task  specific  templates  represent  the  meaning  of  the  relevant 
information.  During  the  IE  task  the  templates  are  filled  with  the  extracted  information.  Therefore,  IE  can 
be  seen  as  the  process  of  normalizing  free-form  text  into  a  defined  semantic  structure. 

To  realize  an  IE  system,  language  resources  (lexicon,  grammar)  and  appropriated  parsing  software  are 
necessary.  This  software  must  be  language-specific.  Thus,  the  IE  tools  for  the  English  language  are  not 
appropriated  for  analyzing  German  texts  (due  to  the  free-order  of  the  language). 

In  order  to  achieve  robust  and  efficient  IE  systems,  domain  knowledge  (ontology)  must  be  integrated  and 
shallow  algorithms  must  be  used.  The  domain  knowledge  is  tightly  integrated  with  the  language 
knowledge,  e.g.,  the  name  ‘Leopard’  in  the  lexicon  has  the  categorical  information  ‘tank’.  This  association 
between  words  and  semantic  information  is  domain-specific  and  has  to  be  change  for  other  applications. 

The  IE  process  itself  is  divided  into  sub  steps.  After  tokenizing  the  text,  the  sentence  boundaries  must  be 
identified.  Then,  the  morphological  component  identifies  the  word  stems,  the  abbreviation,  and  detects  the 
syntactic  information  (e.g.,  grammar  case  and  gender).  After  this,  the  chunk  parsing  with  transducers 
selects  parts  of  the  natural  language  text  that  are  relevant  for  the  specific  information  extraction  task.  The 
chunks  are  then  used  to  instantiate  the  templates,  which  represent  the  result  of  the  IE  process. 


( 

( : TYPE  .  : MILITARY- ITEM) 

( : LOCATION 

( 

(: COORDINATES  .  " 32upd02 90 0 8 0 1 0 0 " ) 
(  : TYPE  .  : POINT) 

) 

) 

(: QUALIFIER  .  : AT ) 

) 

Figure  3:  Example  of  a  feature  structure 


The  instantiated  templates  are  a  formal  description  of  the  meaning  of  the  texts.  In  our  research  project 
SOKRATES  and  in  this  paper  we  use  typed  feature  structures  as  templates.  For  more  details  about 
information  extraction  from  battlefield  reports  refer  to  [10]  and  [12]. 
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4.0  FORMALIZING  THE  NATURAL  LANGUAGE  PARTS 

In  this  section  we  describe  in  detail,  how  the  meaning  of  the  natural  language  part  of  the  military  reports 
can  be  formalized.  The  meaning  is  represented  by  typed  feature  structures.  The  general  feature  structure 
formalism  is  widely  used  in  computational  linguistics,  e.g.,  in  the  HPSG  approach  (cf.  [15]).  Each  feature 
structure  consists  of  an  unordered  quantity  of  features  and  a  type  declaration.  In  Figure  3  the  type 
declaration  is  given  by  (  :  TYPE  .  :  MILITARY-ITEM)  .  All  other  bracketed  terms  within  the  highest 

pair  of  brackets  represent  the  set  of  the  features.  Each  feature  consists  of  a  name  (e.g.,  :  QUALIFIER) 
and  a  value  (e.g.,  :  AT).  In  the  simplest  case  the  value  is  a  number,  a  character  string  or  an  atomic  type.  In 
addition,  a  feature  value  can  recursively  be  a  feature  structure,  e.g.,  the  value  of  the  feature  :  LOCATION 
is  a  feature  structure  of  the  type  :  POINT. 

If  the  IE  processes  the  natural  language  part  of  a  military  report,  the  feature  structures  result  from  two 
sources.  They  can  be  part  of  a  lexicon  entry  or  they  are  constructed  during  the  IE  process. 

During  the  development  of  an  IE  system  an  important  development  step  is  the  determination  and 
definition  of  the  used  feature  structures.  Because  these  structures  can  be  based  on  each  other,  they  are 
arranged  in  a  hierarchy.  In  Figure  4,  a  part  of  the  feature  structure  hierarchy  is  shown  that  is  used  in  the 
SOKRATES  IE  module  (the  whole  hierarchy  can  be  found  in  [12]).  The  topmost  feature  structure  is  of  the 
type  feature-structure.  This  structure  has  no  features.  A  possible  subtype  of  feature-structure  is  the 
feature  type  object  ("A  ^  B"  means  "A  is  a  supertype  of  B.").  This  type  has  also  various  subtypes;  one  of 
them  is  the  type  equipment,  equipment  includes  the  features  that  are  listed  in  square  brackets  under  the 
type  name.  For  each  feature  the  name  (e.g.,  hostility)  and  the  feature  type  of  the  possible  feature  values 
(e.g.,  hostility-type)  are  given.  The  features  are  inherited  down  the  hierarchy.  So  the  types  vehicle, 
weapon  and  also  antitank  have  the  same  features  as  equipment.  Some  of  the  types  are  atomic  and  have 
no  features,  e.g.,  all  subtypes  of  nationality-code.  If  features  are  defined  for  a  subtype,  these  features 
extend  the  set  of  the  inherited  features.  If  the  type  of  the  feature  value  is  enclosed  by  "{ the  feature 
value  consists  of  a  set  of  feature  structures  of  the  indicated  type.  If  the  type  is  enclosed  by  M<  ...  >"  the 
value  consists  of  an  ordered  list  of  feature  structures. 


feature-structure 

object 

equipment 

[ 

count  number 
hostility  hostility-type 
nationality  nationality-code 
abbreviation  string 
name  string 

qualifiers  {property-value} 

l 

vehicle 

weapon 

antitank 


Figure  4:  Part  of  the  feature  structure  hierarchy 
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The  meaning  of  the  natural  language  part  of  a  military  report  is  represented  by  a  feature  structure  that  is 
one  of  the  subtypes  of  the  type  action.  The  action  type  move  (see  Figure  5)  is  used  to  describe  movements 
on  the  battlefield.  The  basic  structure  of  move  was  adopted  from  the  FrameNet  project  (cf.  [13])  and 
adapted  to  the  military  requirements.  During  a  movement,  objects  are  moved.  These  objects  are 
represented  as  feature  values  of  feature  "theme”.  The  objects  are  moving  from  a  starting  point  (source 
military-item)  to  an  end  point  (goal  military-item )  using  a  path  (path  <military-item>).  The  objects  can 
also  be  in  an  area  (area  {military-item}). 


move 

[ 

qualifiers  {property} 
area  {military-item} 
distance  number 
duration  number 
goal  military-item 
path  <military-item> 
source  military-item 
speed  number 
start-time  time 
theme  theme 
] 

Figure  5:  Feature  structure  'move' 

The  feature  "qualifiers”  contains  descriptions  of  adverbial  phrases  of  the  movement  (e.g.,  "it  moves  fast"), 
if  the  descriptions  are  not  contained  in  other  features  (e.g.,  in  "goal  <military-item>"). 
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feature-structure 
^  object 

equipment 

[ 

count  number 

qualifiers  {property-value} 

] 

vehicle 

weapon 

^  military-item 

[ 

] 

facility 

airfield 

barrier 

partner 

^  multiple-partner 

[ 

first-partner  single-partner 
second-partner  partner 
] 

single-partner 

Figure  6:  Subtypes  of  type  'object' 


In  some  of  the  feature  values  of  the  type  move,  structures  of  the  subtypes  of  object  are  used.  In  Figure  6 
these  subtypes  are  shown.  Ordnance  can  be  found  under  equipment.  Other  military  objects  and  concepts 
(e.g.,  airfield,  barrier)  are  subtypes  of  military-item. 

In  order  to  describe  movements,  concepts  of  locations  and  directions  are  necessary.  Directions  are 
formalized  by  the  type  direction  (see  Figure  7),  locations  by  location  (see  Figure  8)  and  its  subtypes. 
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feature-structure 

4  direction 

[ 

directed-to  dir-code 
qualifier-to  dir-qualifier 

] 

^  dir-code 

^  east 
north 

^  northeast 

Figure  7:  Formalization  of  directions 


feature-structure 

location 

[ 

name  string 

] 

point 

[ 

coordinates  string 

] 

Figure  8:  Formalization  of  locations 


In  the  following,  we  show  how  the  meaning  of  the  natural  language  part  of  a  military  battlefield  report  is 
represented  with  the  introduced  feature  structures.  The  part  is: 

"2  BRDM  2  und  1  PT  76  durchfahren  in  rascher  Fahrt  auf  Strafie  14-1 
die  Ortschaft  SUL  JIN  HAN  (CQ  5458)  nach  Western  " 

("2  BRDM  2  and  1  PT  76  pass  through  the  locality  SULJIN  HAN  (CQ  5458)  on  the  road  14  -  1  in  fast 
travel  to  the  west.”) 

In  the  tables  of  Figure  9,  Figure  10,  Figure  11  and  Figure  12  the  different  feature  structures  are  shown, 
which  are  produced  by  the  IE  from  various  parts  of  the  text.  In  these  tables  is  specified, 

•  which  text  part  induces 

•  which  syntactical  phrase  and 

•  the  constructed  feature  structure  (Feature  structure  A). 

Also  shown  is: 

•  the  syntactical  function  and 

•  the  feature  structure  that  will  be  part  of  the  move-structure  (Feature  structure  B). 
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Report 

part 

2  BRDM  2 

und 

1PT76 

Phrases 

NP! 

Conj 

np2 

NP 

Feature 

structure 

A 

( 

(  : TYPE .  : TANK) 

( : ABB . "brdm-2") 

(: COUNT  .  2) 

) 

:  SET 

( 

(  : TYPE  .  : TANK) 

( : ABB  .  "pt 7 6 " ) 

(: COUNT  .  1) 

) 

Syntactical 

function 

Subject 

Feature 
structure  B 

(  : OBJECTS 
(  :  SET 

( 

( 

(  : TYPE  .  : TANK) 

(: ABBREVIATION  .  "brdm-2") 

(  : COUNT  .  2 ) 

) 

( 

( : TYPE  .  : TANK) 

(: ABBREVIATION  .  "pt76") 

( : COUNT  .  1 ) 

) 

) 

) 

) 

Figure  9:  Example  -  I 


The  part  "2  BRDM  2  und  1  PT  76"  is  a  nominal  phrase  (NP),  which  is  constructed  from  two  smaller 
phrases  and  a  conjunction.  For  each  of  these  phrases  a  feature  structure  of  the  type  :  TANK  is  produced. 
The  whole  NP  is  the  subject  of  the  sentence.  Both  tank  feature  structures  are  stored  as  values  of  the 
objects  feature  in  the  move  structure. 
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Report 

part 

durchfahren 

in  rascher  Fahrt 

Phrases 

VP 

pp 

Feature 

structure 

A 

(  : TYPE  .  : MOVE ) 

( 

(: VALUE  .  : FAST) 

(  : TYPE  .  : PROPERTY) 

) 

Syntactical 

function 

Predicate 

Adverbial  phrase 

Feature 
structure  B 

(  : TYPE  .  : MOVE ) 

(  : QUALIFIERS 
(  :  SET 

( 

( 

(: VALUE  .  : FAST) 

(  : TYPE  .  : PROPERTY) 

) 

) 

) 

) 

Figure  10:  Example  -  II 


The  verb  "durchfahren"  (Mto  pass  through”)  forms  the  verbal  phrase  (VP).  This  VP  is  the  predicate  of  the 
sentence.  It  is  the  feature  type  (  :  TYPE  .  :MOVE)  of  the  VP,  which  is  responsible  for  selecting  the 
correct  action  subtype,  in  this  example  the  move  feature  structure. 

The  prepositional  phrase  (PP)  " in  rascher  Fahrt"  (”in  fast  travel”)  produces  a  feature  structure  of  the  type 
:  PROPERTY.  This  PP  is  an  adverbial  phrase,  which  specifies  in  more  detail  the  VP  (the  movement  is  fast, 
not  slow).  The  feature  structure  of  this  PP  is  stored  in  the  qualifiers  feature  of  the  move  structure. 
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Report  part 

auf  StraJie  14-1 

die  Ortschaft  SULJIN  HAN  (CQ  5458) 

Phrases 

PP 

NP 

Feature 
structure  A 

( 

(  : TYPE  .  : WAY) 

( : NAME  .  "14-1") 

(: QUALIFIER  .  :0N) 

) 

( 

(: LOCATION 

( 

(: COORDINATES  .  "cq5458") 

( : TYPE  .  : POINT) 

) 

) 

(:NAME  "suljin  han") 

( : TYPE  .  : T OWN ) 

) 

Syn.  fct. 

Adverbial  phrase 

Object 

Feature 
structure  B 

( : AREA 
(  :  SET 

( 

( 

(  : TYPE  .  : WAY) 

( : NAME  .  "14-1") 

( : QUALIFIER  .  :0N) 

) 

( 

(: LOCATION 

( 

(: COORDINATES  .  "cq5458") 

( : TYPE  .  : POINT) 

) 

) 

(:NAME  "suljin  han") 

(  : TYPE  .  : TOWN) 

) 

) 

) 

) 

Figure  11:  Example  -  III 


The  prepositional  phrase  "auf  Strafe  14-1 "  (Mon  the  road  14  -  1")  produces  a  feature  structure  of  the  type 
:  WAY.  It  is  also  an  adverbial  phrase  and  it  is  stored  in  the  area  feature  of  the  move  structure. 

The  :  TOWN-structure  is  also  stored  in  the  area  feature.  This  structure  results  from  the  report  part  "die 
Ortschaft  SULJIN HAN  (CQ  5458)  "  (’’the  locality  SULJIN  HAN  (CQ  5458)”). 
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Report  part 

nach  Westen 

• 

Phrases 

pp 

Feature 
structure  A 

( 

(: QUALIFIER  .  : TOWARDS) 

( : DIRECTED-TO  .  :WEST) 

(  : TYPE  .  : DIRECTION) 

) 

Syn.  fct. 

Adverbial  phrase 

Feature 
structure  B 

( : GOAL 

( 

( : QUALIFIER  .  : TOWARDS) 

( : DIRECTED-TO  .  :WEST) 

( : TYPE  .  : DIRECTION) 

) 

) 

Figure  12:  Example  -  IV 


"nach  Westen"  ("to  the  west")  produces  a  feature  structure  of  the  type  :  DIRECTION.  This  is  stored  in  the 
goal  feature. 

The  meaning  of  the  whole  natural  language  part  of  the  report  (all  feature  structures  B)  is  represented  as  a 
value  of  type  move  of  the  :MESSAGE-feature  (see  Figure  13  and  Figure  14).  This  is  the  result  of  the  IE 
process,  which  is  attached  to  the  formatted  part  of  the  military  report. 

For  more  information  concerning  the  feature  structures  and  the  information  extraction  refer  to  [12]. 
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(: MESSAGE 
(  :  SET 
( 

( 

(  : AREA 
(  :  SET 
( 

( 

( : TYPE  .  : WAY) 

( : NAME  .  "14-1") 

( : QUALIFIER  .  :0N) 

) 

( 

(: LOCATION 

( 

(: COORDINATES  .  "cq5458") 
( : TYPE  .  : POINT) 

) 

) 


( : NAME  . 

"suljin  han 

) 

) 

) 

) 

(  : TYPE  . 

: TOWN) 

( : QUALIFIERS 
(  :  SET 

( 

( 

( : VALUE 

.  : FAST ) 

) 

) 

) 

) 

(  : TYPE  . 

: PROPERTY) 

(  : TYPE  . 

( : GOAL 

: MOVE ) 

( 

(: QUALIFIER  .  : TOWARDS) 

(  : DIRECTED-TO  .  :WEST) 

(  : TYPE  .  : DIRECTION) 

) 

) 

Figure  13:  The  feature  structure  of  the  example  - 1 
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) 


( : THEME 


( : OBJECTS 
(  :  SET 


( 


) 

( 


) 

) 

) 

) 

(  : TYPE  . 

) 

) 


( : TYPE  .  : TANK) 

(: ABBREVIATION  . 
(: COUNT  .  2) 


(  : TYPE  .  : TANK) 

(: ABBREVIATION  . 
( : COUNT  .  1 ) 


: THEME) 


"brdm-2 " ) 


"pt76" ) 


Figure  14:  The  feature  structure  of  the  example  -  II 


5.0  CONCLUSION 

Due  to  the  new  military  operations  other  than  war,  the  extent  of  the  natural  language  parts  in  formalized 
military  reports  increases.  To  assure  further  on  the  automatic  interoperability  between  different  C4ISR 
systems  three  problems  must  be  solved.  First,  the  natural  language  text  must  be  automatically  transformed 
into  a  formal  structure,  which  represents  the  meaning  of  the  text.  Second,  the  formal  structure  must  be 
exchanged  between  the  different  C4ISR  systems.  And  third,  the  representation  of  the  meaning  must  be 
automatically  inserted  into  the  C4ISR  database. 

In  this  paper,  we  first  explained  the  extended  view  on  the  interoperability  problem.  Then,  we  introduced 
the  Information  Extraction  technology.  In  the  main  part  of  the  paper,  we  showed  that  information 
extraction  techniques  could  be  used  to  solve  the  first  of  the  aforementioned  sub  problems.  For  this,  we 
introduced  typed  feature  structures,  explained  parts  of  the  feature  structure  hierarchy  and,  we  gave  an 
extensive  example  how  to  formalize  the  natural  language  parts  of  a  report. 
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■  Spectrum  of  relevant  military  information: 

♦  highly  formalized  military  messages,  vs. 

♦  natural  language  texts  with  little  structure  (e.g. 
documents,  e-mails,  web  pages). 

■  The  extent  of  the  natural  language  parts  in  formalized 
military  reports  increases  due  to  the  new  military 
deployments. 

■  The  automatic  interoperability  process  might  be  interrupted 
by  the  human-oriented  processing  of  the  texts. 
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This  interruption  is  not  acceptable. 

To  solve  the  problem  these  sub  problems  must  be  handled: 

♦  The  natural  language  text  must  be  automatically 
transformed  into 


The  formal  structure  must  be  exchanged  between  the 
different  C4I  SR  systems. 

♦  The  representation  of  the  meaning  must  be 
automatically  processed  and  the  result  inserted  into  the 
C4I  SR  database. 


Claim:  Information  Extraction  (IE)  can  be  used  to  solve  the 
first  sub  problem. 
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formatted  part  of 
military  report 


<TRA  123> 
<MOV  alb  > 


"3  BRDM 
observed ..." 
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natural  language  part 
of  military  report 
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3|l 


I nformation  extraction  (IE)  is  the  task  of  identifying, 
collecting  and  normalizing  information  from  natural  language 
text. 


Relevant  information  about 
looked  for. 


at,  When,  etc.  is 


The  information  of  interest  is  described  through  domain- 
specific  lexicon  rules  and  patterns  called  templates ( e. g . 
typed  feature  structures). 

■  During  the  I E  task  these  templates  are  filled  with  the 
collected  information. 


The  templates  are  domain  and  task  specific,  i.e.  for  each  new 
task  and  domain  they  must  be  newly  created. 
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Partial  analysis  of  a  military  free-form  report: 


when  this  report  was  given 


who  reports 


"  09.  September  10.45  Uhr  von  VN  -  Miiitarbeobachtern  in 
BIJELJINA 
auf  StraBe  14%  1 


cahrek)  in  rascher  Fahrt 

nach 


where  who 


what 


how 


direction 
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Report 


Formal  representation  of  content 


"...  in  BIJ  ELJ INA: 


** 


•• 


•  * 


** 


report 


addressee ... 


•• 


•l! 


message 


♦a 


move 


objects 


tank 


area 


►  way 


name  "BRDM2,  PR76" 
count ... 


name  "14-1" 
location ... 
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Formal  representation 


Use  it 


report 


addressee  ... 


message 


move 


objects 

vehicles 


area 


location 
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Representation  of  meaning  by  typed  feature  structures  (FS) 

■  A  FS  consists  of  a  set  of  features  (name  +  value)  and  a  type: 


type 


features  »*' 


( 

►  (:TYPE  .  : MILITARY-ITEM) 
(:  LOCATION 


( 


(: COORDINATES  .  "32upd0290080100") 
( :TYPE  .  :  POI  NT) 


) 


‘‘4  ) 


(: QUALIFIER  .  :AT) 


) 
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feature  structure 


addressee  partner 
medium  medium 
message  {action} 
speaker  partner 
time  time 

credibility  credibility 


theme  theme 
source  military- item 
path  military-item 
goal  military- item 
area  {military- item} 
distance  number 
start- time  time 
duration  number 
area  {military- item} 
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when  this  report  was  given 


who  reports 


*  * 

"09.  September  10.45  Uhr  von  VN  -  Miiitarbeobachtern  in 
BIJ ELJ I NA:  2  BRDM  2  und  1  PT  76  durchfahren  in  rascher  Fahrt 

nach 


auf  StraiSe  14-1 


Westen." 

A 


where  who 


what 


how 


direction 
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part 

09. September  10.45  Uhr 

phrase 

date 

feature 

structure  A 

( 

(:TYPE  .  :TI  ME) 

(: MINUTE  .  45) 

(:HOUR  .  10) 

(:  MONTH  .  9) 

(:DAY  .  9) 

) 

synt.  funct. 

feature 

structure  B 

(:TI  ME 
( 

(:TYPE  .  :TI  ME) 

(: MINUTE  .  45) 

(:HOUR  .  10) 

(:  MONTH  .  9) 

(:DAY  .  9) 

) 

) 

syntactical  phrase 


FS  of  the  phrase 


syntactical  function 


in  FS  move 
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part 

von  VN  -  Militarbeobachtern 

in  BIJELJINA: 

phrase 

who  reports 

location  of  who  reports 

feature 

structure  A 

( 

(:TYPE  .  :UNIT) 

(:NAME  .  "VN-Militarbeobachter") 

(: ABBREVIATION  .  "VN-MilBeob") 

) 

( 

(:TYPE  .  :TOWN) 

( : NAME  .  "bijeljina") 

( :  QUALI  FI  ER  .  :  EXACTLY-AT) 

) 

synt.  funct. 

feature 

structure  B 

(:  SPEAKER 
( 

(:TYPE  .  :UNIT) 

(:NAME  .  "VN-Militarbeobachter") 

(: ABBREVIATION  .  "VN-MilBeob") 

(:  LOCATED 
( 

(:TYPE  .  :TOWN) 

( :  NAME  .  "bijeljina") 

( :  QUALI  FI  ER  .  :  EXACTLY-AT) 

FGAN 


Forschungsinstitut  fur 

Kommunikation,  Informationsverarbeitung  und  Ergonomie 


KIE 


4|  FS:  Example  - 1 V 


Dr.  M.  Hecking 


part 

2  BRDM  2 

und 

1PT76 

phrase 

NP-1 

conj 

NP-2 

feature 

structure  A 

( 

(:TYPE  .  :TANK) 

( :  ABBREVI  ATI  ON .  ,lbrdm-2") 

(:  COUNT  .  2) 

) 

:SET 

( 

(:TYPE  .  :TANK) 

(:  ABBREVI  ATI  ON  .  Mpt76M) 

(:  COUNT  .  1) 

) 

synt.  funct. 

subject 

feature 

structure  B 

(:  OBJECTS 
(:SET 

( 

( 


(:TYPE  .  :TANK)  (:ABBREVI  ATI  ON  .  "brdm-2")  (:  COUNT  .  2) 

) 

( 

(:TYPE  .  :TANK)  (:ABBREVI  ATI  ON  .  Mpt76M)  (:COUNT  .  1) 

) 
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part 

durchfahren 

in  rase  her  Fahrt 

phrase 

VP 

pp 

feature 

structure  A 

(:TYPE  .  :  MOVE) 

( 

(:  VALUE  .  :FAST) 

(:TYPE  .  :  PROPERTY) 

) 

synt.  funct. 

predicate 

adverbial 

feature 

structure  B 

(:TYPE  .  :  MOVE) 

( :  QUALI  FI  ERS 
(:SET 
( 

( 

(:  VALUE  .  :FAST) 
(:TYPE  .  :  PROPERTY) 

) 

) 

) 

) 
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part 

auf  Stra Be  14-1 

die  Ortschaft  SULJIN  HAN  (CQ 5458) 

phrase 

PP 

NP 

feature 

structure  A 

( 

(:TYPE  .  :WAY) 

( :  NAME  .  "14-1") 

(: QUALIFIER  .  :ON) 

) 

( 

( :  LOCATI  ON 

(  (:  COORDI  NATES  .  ncq5458M) 

(:TYPE  .  :  POI  NT) ) 

) 

( : NAME  "suljin  han") 

(:TYPE  .  :TOWN) 

) 

synt.  funct. 

adverbial 

object 

feature 

structure  B 

(:AREA 

(:SET 

(  (  (:TYPE  .  :WAY)  (:NAME  .  "M-l")  ( : QUALI FI ER  .  :ON)  ) 

(  (: LOCATION  ((: COORDI NATES  .  Mcq5458M)  (:TYPE  POINT)  )  ) 
(:NAME  "suljin  han") 

(:TYPE  .  :TOWN)) 

))) 
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part 

nach  Westen. 

phrase 

pp 

feature 

structure  A 

( 

(: QUALIFIER.  :TOWARDS) 

( :  Dl  RECTED-TO  .  :  WEST) 
(:TYPE  .  : DIRECTION) 

) 

synt.  funct. 

adverbial 

feature 

structure  B 

( :  GOAL 
( 

(: QUALIFIER.  : TOWARDS) 
(:DI  RECTED-TO  .  :WEST) 
(:TYPE  .  : DIRECTION) 

) 

) 
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Equipment: 

■  Examples:  "Auto",  "Bus",  "LKW",  "PKW",  "Panzer'V'Haubitze", 
"Wiesel",  "Marder",  "Leopard",  "AML  -  90",  "BRDM  2",  "BM  - 
22  Mehrfachraketenwerfer  220  mm",  "2  feindliche  T80", 
"Fahrzeug",  ... 


(:TYPE  .  :TANK)  (:  COUNT  .  2) 

( :  NAME  .  Mt80M)  (:ABBREVI  ATI  ON  .  Mt80M) 
( :  QUALI  FI  ERS 
(:SET 
( 

:  HOSTI LE 

) 

) 

) 
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Units: 


Examples:  "I  nfanteriezug",  "PzMrsKp  (GE)  in  Wriedel  (Zug  bei 


irockhoefe  (: 


.0075100: 


(:  SPEAKER  ^ 

(  (:  SECOND- PARTNER 

(  (:NAME  .  "Zug")  (:TYPE  .  :UNIT)  (:SIZE  .  :PLT)  (:ABBREVI  ATI  ON  .  "Zug") 

(:  LOCATED 

(  (:TYPE  .  :TOWN)  ( : NAME  .  "brockhoefe")  (: LOCATION  ...)  (: QUALIFIER 
(:  FIRST-PARTNER 

(  ( : NATI ONALI TY  .  :GE)  (: ARM- CAT  .  : ARMOUR)  (:SIZE  .  :COY) 

(:NAME  .  "Panzer  Morser  Kompanie")  (:TYPE  .  :UNIT) 

(:  LOCATED 

(  (:TYPE  .  :TOWN)  ( :  NAME  .  "wriedel")  (:QUALI  FI  ER  .  :  EXACTLY- AT) ))) ) 
(:TYPE  .  : MULTIPLE-PARTNER) 

) 

) 


:AT))))) 
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Locations: 


Examples:  "CQ  123456",  "BANOVICI  (YK  1234)",  "die 
StraSenkreuzung  kr3",  "kr5  (CQ  072368)",  "kr3",  "Stellung 
1.3  (CQ  072368)",  "StraGenkreuzun cBfco  07\ 

MILESKI |HC0  0737)",  ... 


( 


( :  QUALI  FI  ERS 
(:SET 


( 


) 


(  (:TYPE  .  :TOWN)  ( : NAME  .  "mileskij")  (: QUALIFIER  .  :SOUTH-OF) 
( : LOCATI ON  (  (: COORDI NATES  .  Mcq737M)  (:TYPE  .  :POINT)))) 


) 

( :  LOCATI  ON  (  (:  COORDI  NATES  .  Mcq72368M)  (:TYPE  .  :POINT))) 
(:TYPE  .  : CROSSING) 


FGAN 


Forschungsinstitut  fur 

Kommunikation,  Informationsverarbeitung  und  Ergonomie 


KIE 


4|  FS:  Nominal  Phrases 


Dr.  M.  Hecking 


Nominal  phrases: 

■  Examples:  "1  LKW",  "neun  Busse",  "1  beschadigter  Panzer",  1 
serbisches  Fahrzeug",  "1  grower  LKW",  1  eigener  Leo",  "1 
zerstorter  T80",  "11  kroatische  Fahrzeuge",  "zwolf  wiesel  1 
tow",  "Vier  alte  leopard  la5",  "2  AML  -  90  und  1  BRDM  2"  "2 
feindliche  T80",  "1  funktionsuntuchtiger  bm  -  22 

",  "2  AML  -  90  und  1  BRDM 


2 


♦ 

♦ 

♦ 


♦ 

♦ 


TYPE  .  :TANK) 


ABBREVIATION  .  Mbm22M) 


QUALI  FI  ERS  ( :  SET  ( :  DAMAGED) ) ) 


COUNT  .  1) 
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Prepositional  phrases: 

■  Examples:  "bei  Ebstorf  (CQ  123456)",  "bei  kr7 
(32UPD0290080100)",  "nach  Osten",  "in  Richtung 
StraSenkreuzung  (CQ  072368)",  "in  Stellung  1.3  (CQ 
123456)",  "von  Ebstorf  (CQ  123456)  uber  Banovici  (YK 
45678)  nach  Berlin  (CQ  56789)",  "auf  StralSe  14  -  1",  "bis 
Ebstorf  (CQ  123456)", 

( 

(:TYPE  .  :TOWN) 

(:NAME  .  "ebstorf") 

(LOCATION  ((COORDINATES  .  "cql23456")(:TYPE  .  :POINT))) 

( :  QUALI  FI  ER  .  : THROUGH) 

) 
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Other  phrases: 


■  Examples:  "ostlich  der  StraGenkreuzung  (CQ  072368)", 
schneller  Fahrt",  ' 


( 

(:TYPE  .  :TOWN) 

(:NAME  .  "mileskij") 

( :  LOCATI  ON 

( 

(:COORDI NATES  .  Mcq737M) 
(:TYPE  .  : POINT) 

) 

) 

( :  QUALI  FI  ER  .  :  I  N-THE-NORTH) 


/ 


II 


in 
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The  extent  of  the  natural  language  parts  in  formalized 
military  reports  increases. 

■  To  assure  an  automatic  interoperability  process,  the  natural 
language  parts  must  be  automatically  transformed  into  a 
formal  meaning  representation. 

■  I  nformation  Extraction  and  typed  feature  structures  are  one 
possibility  to  realize  this  automatic  transformation. 

■  I  n  the  main  part  of  the  talk,  an  example  for  the  formalization 
of  the  natural  language  parts  was  given. 
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